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Abstract 

We report a measurement of the top quark mass, m t , obtained from pp collisions at y/s = 
1.96 TeV at the Fermilab Tevatron using the CDF II detector. We analyze a sample corresponding 
to an integrated luminosity of 1.9 fb _1 . We select events with an electron or muon, large missing 
transverse energy, and exactly four high-energy jets in the central region of the detector, at least 
one of which is tagged as coming from a b quark. We calculate a signal likelihood using a matrix 
element integration method, where the matrix element is modified by using effective propagators to 
take into account assumptions on event kinematics. Our event likelihood is a function of mt and a 
parameter JES that determines in situ the calibration of the jet energies. We use a neural network 
discriminant to distinguish signal from background events. We also apply a cut on the peak value 
of each event likelihood curve to reduce the contribution of background and badly reconstructed 
events. Using the 318 events that pass all selection criteria, we find m t = 172. 7± 1.8 (stat. + JES) 
±1.2 (syst.) GeV/c 2 . 

PACS numbers: 14.65.Ha 
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I. INTRODUCTION 



The top quark mass, m t} is an important parameter in the standard model of particle 
physics. Since the discovery of the top quark in 1995, there have been many reported mea- 
surements of its mass, all from the CDF and DO experiments at the Fermilab Tevatron 
The standard model relates the top quark and W boson masses to the mass of the pre- 
dicted Higgs boson via loop corrections. Precision measurements of m t and the W boson 
mass mwi i n conjunction with many other precision electroweak measurements, thus provide 
constraints on the value of the Higgs boson mass Q]. 

The measurement reported here uses pp collision data corresponding to an integrated 
luminosity of 1.9 fb _1 , collected by the CDF II detector during Run II of the Fermilab 
Tevatron collider at y/s = 1.96 TeV. In pp collisions, top quarks are produced predominantly 
as ti pairs, and present measurements within the standard model framework indicate that 

n 

the top quark decays to a W boson and a b quark nearly 100% of the time |3j. The W 
boson can decay into either a charged lepton and a neutrino ( "leptonic decay" ) or a quark- 
antiquark pair ("hadronic decay"). We select events in which one of the W bosons decays 
leptonically and the other decays hadronically, where the lepton in the leptonic decay is 
required to be an electron or muon; this channel is referred to as the "lepton + jets" channel. 
Decays of W bosons into a tau lepton are not explicitly included in our model, although 
some events containing a tau lepton which decays into an electron or muon do pass our 
selection criteria and amount to approximately 7% of the ti signal. In general, an event in 
our candidate sample has four high-energy jets (two of which come from the parton shower 
and hadronization associated with the quarks from the hadronic W boson decay and two 
from the parton shower and hadronization of the b quarks), a charged electron or muon, 
and an unobserved neutrino. For a given Tevatron integrated luminosity, the lepton + jets 
channel allows for more precise measurements than channels in which both W bosons decay 
leptonically or hadronically, as it offers the best balance of available statistics and sample 
purity. The most recent mt measurements obtained at the Tevatron using the lepton + jets 
topology are reported in Ref. (J). 

The method we use to extract the top quark mass from a sample of candidate ti events 
is a modified matrix element integration method. The matrix element approach to the 
top quark mass measurement jlj] is based on integrating over the tree-level phase space of 
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the process, where each kinematic configuration of the tree-level partons is weighted by the 
matrix element squared and by the probability that the detector observables can be produced 
by the final state particles. With an appropriate normalization factor, this integral defines 
the probability to see an event with this configuration in the detector. By multiplying the 
individual event probabilities, we obtain a likelihood, as a function of m t , of seeing the event 
sample observed in our detector. 

In theory, the distributions of the invariant masses of the top quark and the W boson 
decay products are dominated by propagator-induced terms in the matrix element squared. 
The top quark and the W boson widths are relatively narrow in comparison with their 
respective masses. This leads to nearly pure relativistic Breit-Wigner distributions for the 
invariant masses of their decay products. Compared to this theoretical prediction, finite 
resolution of the detector measurement naturally results in a widening of the observed dis- 
tributions. We describe the widening due to an imperfect measurement of magnitudes of 
jet momenta in terms of detector transfer functions. Effects of other uncertainties, such as 
finite angular resolutions, are modeled by replacing the Breit-Wigner terms in the matrix 
element squared with empirically determined distributions called "effective propagators" in 
this paper. This modification of the matrix element improves the observation model for ti 
events. 

The matrix element used in this work includes ti production, from both quark- 
antiquark and gluon-gluon collisions, and decay into the lepton + jets channel. Since we 
do not know which jet observed in our detector corresponds to which parton in the matrix 
element, we calculate the likelihood for each possible assignment of jets to partons and sum 
the likelihood over all permutations. Each permutation includes a weight, which takes into 
account the probability that the permutation is consistent with the observed information on 
whether the jet has been tagged or not as a 6-jet. Tagging of 6-jets is done by the displaced 
vertex technique discussed in Section IIHI For each permutation, the matrix element inte- 
gration is performed over the seven kinematic variables in the event that remain after a set 
of simplifying assumptions. 

We improve the precision of the method by introducing another parameter into our 
likelihood, the jet energy scale (JES). This is a scale factor which multiplies the energy of all 
jets. The uncertainty on the jet energy scale is the major source of systematic uncertainty 
on the top mass measurement; by including JES as a parameter in the likelihood, we can 
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use the W boson decay to hadrons in the tt decay chain to provide in situ calibration, thus 
reducing the systematic uncertainty due to JES. Our final likelihood calculated for each 
candidate event is thus a function of both the top quark mass and JES. 

Our model is designed to fit lepton+jets tt events where the final objects observed in the 
event come directly from tt decays. We thus need to take into account non-tt events or tt 
events where some of the observed objects do not originate from tt decay. The probability 
that a tt candidate event is background is estimated using a neural network output that is 
a function of several shape and kinematic variables, and then their expected contribution to 
the likelihood is subtracted from the total likelihood. In addition, we cut on the magnitude 
of the peak of the likelihood for an event to further reduce background and badly modeled 
events. 

We multiply the individual likelihood curves from each event to get an overall likelihood. 
Because of the assumptions made, and the presence of background events, the extraction 
of a mass value and its uncertainty needs a calibration, which we obtain from Monte Carlo 
simulated events. 

Section [III is a brief description of the CDF II detector and its use for the measure- 
ments needed in this analysis. Section II III defines the data sample used for this analysis 
and the estimated background. Section [IV] describes the likelihood construction for tt signal 
events. Section [V] explains how non-tt events are incorporated into the likelihood function. 
Section IVII describes how the method is tested and calibrated. Section IVHI covers the sys- 
tematic uncertainties. Section IVIIII summarizes the results obtained by applying the method 
to the data. Finally, Section [IX] gives the conclusions. 



II. THE CDF II DETECTOR 



A complete description of the CDF II detector and its use in lepton, jet, and secondary 
vertex reconstruction can be found elsewhere [7|. Here, we describe the components that 
are essential for this analysis and how they are used. 

The CDF II detector is a general-purpose detector with a cylindrical geometry featur- 
ing forward-backward symmetry and axial symmetry around the beam pipe. The CDF 
coordinate system uses a cylindrical system centered at the interaction point with the z 
(longitudinal) axis along the proton beam direction, r the distance to the beam line, and </> 
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the azimuthal angle around the beam line. We also use 6, the polar angle from the beam 
line. The pseudorapidity rj of a particle three-momentum is defined in terms of the polar 
angle 9 by 7] = — ln(tan(#/2)). For a particle with momentum p and energy E, we define the 
transverse momentum p? and the transverse energy Et as psm.9 and Esm.6, respectively. 
The detector covers the complete solid angle in 9 and up to \rj\ = 3.6. 

The innermost part of the detector consists of the charged particle tracking detectors, 
which are immersed in a 1.4 T magnetic field provided by a superconducting solenoid oriented 
parallel to the beam axis. Calorimeters and muon systems outside the solenoid provide 
lepton measurement and identification in addition to jet momentum measurements. The 
tracking detectors and calorimeters together provide identification of jets from heavy (charm 
and bottom) quarks. 

The first component of the tracking system is a series of silicon microstrip detectors 
between radii of 1.5 and 28 cm. The innermost layer (LOO) is a single-sided layer of 
silicon attached directly to the beam pipe, providing a position measurement very close to 
the collision point. Five layers of double-sided microstrip detectors (SVXII) cover up to 
r = 10.6 cm in the \rj\ < 1.0 region [9]. Each layer has one side with strips oriented parallel 
to the beam axis to provide measurements in the r-<p plane and one side at a stereo angle 
to provide three-dimensional measurements; two layers have strips at a 90° angle and three 
layers have strips at a 1.2° angle. The ISL [10] is an additional set of silicon microstrip 
detectors located outside of SVXII to provide measurements at larger distances from the 
beam line, thus improving the silicon tracking. It consists of one layer at r = 22 cm in 
the central region (\rj\ < 1.0) and two layers at r = 20 cm and r = 28 cm in the forward 
region (1.0 < \r)\ < 2.0). The typical resolution of these detectors in the r—(j) plane is 11 /an. 
The impact parameter resolution of this system is a(do) ~ 40 /im, of which approximately 
35 /im is due to the transverse size of the Tevatron interaction region. Outside of the silicon 
layers lies the central outer tracker (COT) Hj, an open-cell drift chamber detector, which 
provides coverage for \rj\ < 1.0. Multiple wire planes, each with 12 sense wires, are grouped in 
8 superlayers which extend to a radius of 137 cm. The superlayers alternate between having 
wires parallel to the beam axis and wires skewed by a ±2° stereo angle, thus providing up to 
96 points for track reconstruction. Together with the additional constraint coming from the 
primary vertex position, these tracking elements provide a resolution on the track transverse 
momentum, p?, of ct(pt)/pt ~ 0.1% -pr/(GeV/c). 
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Outside the tracking system and the solenoid are segmented electromagnetic and hadronic 
calorimeters. The central calorimeter covers up to < 1.1 and has a projective geometry 
consisting of towers segmented in r\ and <j) pointing toward the center of the detector. The 
central electromagnetic calorimeter (CEM) consists of alternating layers of lead plates and 
plastic scintillators, 18 radiation lengths deep. The energy detected in small contiguous 
groups of calorimeter towers is summed into electromagnetic clusters. These clusters are 
identified as electron candidates if they match a track reconstructed in the tracking sys- 
tem and if very little energy is detected in the surrounding towers (i.e., if the cluster is 
isolated). The electron energy resolution for an electron with tranverse energy Ex is given 



by <j(Et)/Et ~ 13.5%/ y-Er/GeV © 2%. Approximately at shower maximum are propor- 
tional strip and wire chambers (CES) which provide finer position resolution for electron 
and photon identification. The central hadronic calorimeter (CHA) is composed of alternat- 
ing layers of iron plates and scintillators, 4.5 nuclear interaction lengths deep, again with 
projective geometry segmentation. A plug tile calorimeter covers the forward region with 
1.1 < \rj\ < 3.6, consisting of a lead/scintillator electromagnetic portion (PEM), scintillator 
strips at shower maximum (PES), and an iron/scintillator hadronic portion (PHA). An ad- 
ditional hadronic calorimeter (WHA) covers the region between the plug calorimeter and the 
central calorimeter and improves the hermeticity of the detector. These calorimeters provide 
jet measurements with a resolution of approximately ct(Et) ~ 0.1 ■ + 1.0 GeV 121 ] . 

In the central and forward regions, jets are reconstructed with a cone algorithm 12], 
which adds groups of electromagnetic (£em) an d hadronic clusters (.Eh ad) that fall within 
a cone of radius AR = a/ A</> 2 + Arj 2 < 0.4 around a seed tower with energy of at least 
1 GeV. The jet energies are corrected for multiple primary interactions (pileup) and for 
detector effects including a calibrated non-linearity in the calorimeter and average losses 
in non-sensitive regions of the calorimeter. The jet energies are also corrected for hadronic 
physics effects. Soft hadroproduction in the underlying event tends to increase the measured 
jet energy, while the limited cone size of the jet clustering algorithm gives rise to out-of- 
cone losses jl3j ]. Uncertainties for each of these corrections contribute to the jet systematic 
uncertainty, and are used to assign an uncertainty on the top quark mass. 

In the lepton + jets channel one of the W bosons decays into a lepton and a neutrino, 
which escapes undetected. This results in less energy being measured in our detector than 
we would otherwise expect. We require this as a signature for tt events. Specifically, we 
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define a quantity, the missing Et {fir) , to measure the resulting transverse energy imbalance 
as follows: 



E T =\ E Tin Ti \, (1) 

i 6 towers 

where is the unit vector in the x-y plane pointing from the primary vertex to a given 
calorimeter tower and is the uncorrected Et measured in that tower. Two additional 
corrections to this quantity are made. One is to account for muons, which, unlike other 
particles, typically deposit only a small fraction of their energy in the towers, and the other 
is to take into account the corrections applied to the raw energies of the jets. Details of 
these corrections can be found in Ref. fl4]. 

Muon identification takes place in three separate subdetectors. Two of these are in the 
central region: one set of four layers of drift chambers (CMU) located outside the central 
calorimeters (after 4.6 hadronic absorption lengths of material), and another set of four 
layers (CMP) located outside the magnet return yoke, which provides an additional 60 cm 
of absorbing steel. Muon tracks in this region are required to pass through both detectors 
and are called CMUP muons. These two subdetectors cover the region \rj\ < 0.6. Muons in 
the region 0.6 < \r)\ < 1.0 are detected by an additional set of four layers of drift chambers 
(CMX), completing the full fiducial region of the COT. CMUP or CMX track segments are 
matched to tracks in the COT; in addition, the energy deposited in the CEM and CHA is 
required to be small. 

The trigger system is used to record events with high-p^ leptons. The trigger is a three- 
level filter in which the first two levels use specialized hardware and utilize only the detector 
subsystems with fast readout. The third level is a complete reconstruction of the event 
using the same software used for the offline reconstruction, but with less stringent cuts. 
The level 1 (LI) trigger uses information from the calorimeter clusters and from the XFT 
(eXtremely Fast Tracker), which reconstructs tracks from the COT r—<p information with a 
momentum resolution given by <j(pt)/pt ~ 2% -^/(GeV/c) [l^. The LI central electron 
trigger requires a track with pt > 8 GeV/c pointing to a tower with Et > 8 GeV and 
-Ehad/^em < 0.125. The L2 trigger adds clustering in the CEM calorimeter and requires 
that a cluster with Et > 16 GeV matches with a pt > 8 GeV/c track. The LI and L2 muon 
triggers require a track with p T > 4 GeV/c (CMUP) or p T > 8 GeV/c (CMX) pointing to 

13 



a track segment in the respective drift chamber system. A complete lepton reconstruction 
is performed in the L3 trigger, where Et > 18 GeV is required for electrons and pt > 18 
GeV/c is required for muons. 



III. DATA SAMPLE AND BACKGROUND 

As mentioned previously, we search for events in the lepton + jets topology, where a ti pair 
is produced, each top quark decays into a W boson and a b quark, and one W boson decays 
leptonically and one hadronically. We thus identify our top quark candidates by selecting 
events with four high-energy jets, a high-energy electron or muon, and Et from a neutrino. 
Specifically, we require either an electron with Et > 20 GeV or a muon with pt > 20 GeV/c 
in the central region (\rj\ < 1.0) of the detector. Electron and muon identification criteria 
are discussed in Ref. [2). For the neutrino, we require Et > 20 GeV in the event. We require 
exactly 4 jets with Et > 20 GeV and pseudorapidity \rj\ < 2.0. The jet Et is corrected for 
pileup, inhomogeneities of the detector, and nonlinear calorimeter response as a function of 



jet pt and r\ [13|. The additional corrections (underlying event and out-of-cone losses) are 
not used in the analysis, but their uncertainties are taken into account in evaluating the 
systematic uncertainties on the final result. 

Non-tt events that contain a W boson and four jets are able to pass the aforementioned 
selection cuts. However, most of these events do not contain b quarks in their final state, 
while ti events will nearly always have two b quarks. The b quarks from top quark decay 
hadronize into B-hadrons with energies on the order of several tens of GeV, due to the high 
mass of the parent top quark. Since the B hadron decay time is approximately 1.5 ps, it is 
possible to reconstruct secondary vertices within a jet using the charged particles from the 
B decay [16(. A jet with an identified secondary vertex is called a 6-tagged jet. Therefore, 
to further increase the ti purity of the sample, we require that at least one of the jets must 
be tagged as a fe-jet using a secondary vertex tagging algorithm. 

The outline of the 6-tagging algorithm used, secvtx, is as follows: first, the charged 
particle tracks in the jet are subjected to selection cuts to ensure that a quality secondary 
vertex can be reconstructed. There must either be at least three tracks with pt > 0.5 GeV/c 
where at least one of the tracks must be > 1 GeV/c, or at least two tracks with pt > 1 
GeV/c. Once the secondary vertex is reconstructed using the tracks, the distance in the 
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x—y plane between the primary and secondary vertices is projected onto the direction of the 
jet; this quantity is referred to as L 2 d (see Fig. [p. A jet is tagged if L 2 d > 7.5 o"l 2D , where 
<7l 2D , the uncertainty on L 2D , is approximately 190 fim. For 6-jets produced in ti decay, the 
6-tagging efficiency is about 40%, while light jets are misidentified as 6-jets with a rate of 



less than 2%. For more details see Ref. 



16] . 




Jet axis 



Primary vertex 



FIG. 1: A view of a 6-jet in the x—y plane. L 2 d, the distance between the primary and secondary 
vertices projected onto the jet axis, along with its uncertainty, is used to determine whether a jet 
originates from a heavy flavor quark. 

In 1.9 fb _1 of data we find 371 tt candidate events that pass the above selection require- 
ments, 284 of which have one 6-tag and 87 of which have more than one 6-tag (see Tabled]); 
207 of the candidate events contain an electron and 164 contain a muon. The background 
to the tt signal consists of three main sources: a) events where a W boson is produced in 
conjunction with heavy flavor quarks (bb, cc, or c); b) events where a W boson is produced 
along with light flavor quarks where a light flavor jet has been incorrectly tagged with a 
6-tag (mistag); c) QCD events, which do not contain a W boson (non-IF events) but have 
a jet mimicking a lepton, a jet with a 6-tag, and $t- There are also smaller contributions 
from single top quark production, diboson (WW, WZ, or ZZ) production, and Z + jets 
production. The estimated number of background events for each of these sources is derived 
with the method used for the cross section measurement [17| . 
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The contributions for the various types of background shown in Table [I] are estimated 
as follows. First, we define a pretag event sample, which comprises all events that pass all 
the signal selection requirements except for the 6-tag requirement; our final tagged samples 
are thus subsets of the pretag sample. For all samples, the expected number of events for 
diboson, Z + jets, and single top quark backgrounds, as well as the tt signal, are estimated 
using Monte Carlo (MC) simulated events assuming the theoretical cross sections. To sim- 
ulate the signal, we generate tt events at a variety of top quark masses from 152 GeV/c 2 
to 190 GeV/c 2 (needed for our top mass analysis) using the pythia MC generator version 



6.216 



18] . As a cross-check we also use tt signal events generated with the herwig generator 



version 6.510 [19j. For the number of expected tt events used in the background estimate 
we use a top mass of 175 GeV/c 2 , with a calculated tt production cross section of 6.7 ± 0.8 



pb 



20|. 



The jioyi-W contribution is estimated by a fit to the observed $t distribution of expected 
$t distributions for non-W events (which lie mostly in the low fix region) and jy+jets 
events. These distributions are taken from data sidebands (either events with leptons which 
fail to meet the isolation requirements, or from "antielectron" samples, which are electron 
candidates failing two other selection requirements) and from simulated MC events. This 
fit is performed separately for the pretagged and tagged samples to obtain the expected 
number of events in each. 

The W + jets background contribution to the pretag sample is taken as the remainder 
after subtracting all the above pretag contributions. The relative contribution of W + heavy 
flavor events to the pretag W + jets contribution is estimated with MC simulation. We use 



MC events generated with the alpgen [21J generator (version 2.10 prime) along with pythia 
version 6.325 to perform the parton shower and hadronization. The alpgen program is used 
to generate samples with specific numbers of partons in the matrix element; this decreases 
the time to generate events with high jet multiplicity. Since each sample contains a different 
number of partons (for instance, the W + bb sample contains W + bb + Op, W + bb + lp, 
and W + bb+ > 2p contributions), we combine the separate samples using their expected 
fractions, and remove overlaps using the alpgen jet-parton matching along with a jet-based 
heavy flavor overlap removal algorithm 17|. Finally, applying heavy flavor and light flavor 
fe-tag efficiencies we obtain the estimated W + jets contributions to the final sam ple. 

The single top quark contribution is generated using the MadGraph/MadEvent 22] pack- 



16 



age along with pythia for the parton shower and hadronization. Since their expected con- 
tribution is small, we do not use separate MC samples for diboson or Z+jets backgrounds, 
but rather merge them into the W + light flavor total. All MC samples are simulated using 
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the CDF II detector response simulation package 

Table [I] summarizes the data sample composition as a function of the number of tagged 
jets in the event. The total number of expected background events in our data sample is 
iVbg = 70.3 ± 16.5 out of 371 observed events. 

TABLE I: Summary of observed data and predicted ti signal and background contributions as a 
function of 6-tags in the event. 



Background 


1 tag 


> 2 tags 


non-lT QCD 


13.8 ± 11.5 


0.5 ± 1.5 


W + light flavor (mistag) 


16.3 ± 3.6 


0.3 ± 0.1 


Diboson (WW, WZ, ZZ), Z + jets 


5.5 ± 0.4 


0.5 ± 0.1 


W + bb, cc, c 


26.1 ± 10.2 


3.4 ± 1.4 


Single top 


3.0 ± 0.2 


0.9 ± 0.1 


Total background 


64.7 ± 16.3 


5.5 ± 2.6 


Predicted ti signal 


182.6 ± 24.6 


69.4 ± 11.2 


Events observed 


284 


87 



We test our background model by comparing selected kinematic distributions in the data 
with those expected by our model, by adding the MC samples used for the ti signal and 
backgrounds and the data samples used for the QCD background according to their predicted 
proportions. 

Figures [2] and [3] show the comparisons. All of these plots require exactly 4 jets with E? > 
20 GeV and \rj\ < 2.0, but in Fig. [3], we also show the number of jets with lower energies for an 
additional comparison between data and MC. For each quantity, we perform a Kolmogorov- 
Smirnov (K-S) test comparing the data and MC samples; in all cases, the resulting confidence 
level shows a good agreement, which validates the use of the MC generators and the QCD 
background used in this analysis. 



17 



1st jet E T 



2nd jet E T 



o 0.18- 

fo.16F 

1 0.14 
UJ 

0.12 
0.1 
0.08 
0.06 
0.04 
0.02 




1.9 fb" 1 data 
tt signal (172 GeV/c 2 ) 
iiiiiiiiiiiil W + light mistag 
W + heavy flavor 
single top 
I \3 non-W QCD 




K-S CL = 0.39 



50 



100 



150 



200 250 
E T (GeV) 



1 .9 fb 1 data 
tt signal (1 72 GeV/c 2 ) 
4" EMMM W + light mistag 
W + heavy flavor 
single top 
| ^ non-W QCD 




K-S CL = 0.10 



80 100 120 140 160 180 
E T (GeV) 



3rd jet E T 



• 1.9 fb 1 data 

I tt signal (172 GeV/c 2 ) 
I W + light mistag 
W + heavy flavor 
single top 
| ^ non-W QCD 




^0 40 60 80 '100 120 

E T (GeV) 



4th jet E T 



i 07 



£ 0.6 



s 

UJ 0.5 



0.4 
0.3 
0.2 
0.1 



1 .9 fb 1 data 
I tt signal (172GeV/c 2 ) 
t: : I-ii: : II W + light mistag 
W + heavy flavor 
single top 
j 1 non-W QCD 



K-S CL = 0.95 



20 30 40 50 60 " 70 ~ 80 90 

E T (GeV) 



FIG. 2: Comparison of data and MC predictions for the selected events. The confidence level 
obtained from a K-S test on the two distributions is indicated on the histogram. The plots show, 
in order, the corrected Et of the leading jet, 2nd jet, 3rd jet, and 4th jet in our events. 



IV. SIGNAL LIKELIHOOD 



The matrix element method allows for efficient incorporation of the theoretical assump- 
tions about the process under study into the data analysis. The phase space integration 
procedure can be viewed as a Bayesian marginalization of the event probability over all 
unobserved degrees of freedom. Particle theory provides a well-motivated informative prior 
for this marginalization. Maximization of the likelihood with respect to the measured pa- 
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FIG. 3: Comparison of data and MC predictions for the selected events. The confidence level 
obtained from a K-S test on the two distributions is indicated on the histogram. The plots show, 
in order, the event Et-, the total number of jets with Et > 12 GeV and |r/| < 2.4 in the event, the 
Et for all jets with a 6-tag, and the lepton pa- 



rameters results in an efficient (in the statistical sense) estimate of these parameters. 

Instead of attempting to integrate over the complete phase space of the process which 
can include hundreds of particles, we assume that the final state parton showering and 
hadronization processes, together with the detector response, can be modeled empirically 
by the transfer functions. This assumption allows for a drastic reduction in the phase 
space complexity. However, calculation of the remaining tree-level phase space integrals still 
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remains a formidable problem when a large number of events and parameter values must be 
processed. Therefore, we employ additional assumptions, detailed later in this section, to 
reduce the dimensionality. To compensate for these assumptions, we introduce the concept 
of "effective propagators" that modify the tree-level matrix element of the interaction. 

For each event we obtain a tt signal probability as a function of the top quark pole mass 
(m t ) and the jet energy scale (JES) using the following expression: 



L(V ™" JES) = N(m t )A(ml JES) * 

24 

f> / f{Zl pp Z2) TF(y • JES | x) \M eS (m t ,x)\ 2 (2) 
i=i J 

where y are the quantities we measure in the detector (the momenta of the charged lepton 
and all the jets); x are the parton- level quantities that define the kinematics of the event; 
N(mt) is an overall normalization factor; A(m t , JES) is the event acceptance as a function 
of mt and JES; f(z±) and f(z2) are the parton distribution functions (PDF's) for incoming 
parton momentum fractions z\ and Zi] FF is the relativistic invariant flux; TF(y • JES | x) 
are the transfer functions that predict the measured jet momenta distributions given the 
quark kinematics; d$(x) indicates integration over the phase space of the two initial and 
six final state partons and leptons in the tt production and decay (including necessary 
Jacobians); and M e g(mt, x) is the modified matrix element for ti production and decay. The 
PDFs, f{z{) and f{z2), are integrated over the appropriate combinations of incoming qq and 



gluons. We use the CTEQ5L PDFs 24J in our integration. The integral is calculated for 
each of the 24 possible permutations of jet-parton assignment and then summed with the 
appropriate weights Wi, where the weights are determined by the 6-tagging information on 
the jets. Specifically, for each tagged jet in the event, a weight equal to the tag rate of the 
jet is given if it is assigned to a b parton, and a weight equal to the mistag rate is given if 
it is assigned to a light parton. An untagged jet is given a weight of 1 minus the tag rate if 
assigned to a b parton, and 1 minus the mistag rate if assigned to a light parton. The four 
individual jet weights are then multiplied. Figure H] shows the parameterizations of the tag 
rates as a function of jet Ft and \r\ used to determine the Wi values; we assign a probability 
for a c jet to be tagged equal to 0.22 times the probability for a 5-jet with the same Ej 1 
and |^| . These values are derived from Monte Carlo events and then corrected with a scale 
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factor measured in data to account for differing tag rates in Monte Carlo events and data 
events. 



b-jet tagging efficiency vs. jet E T 
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FIG. 4: Efficiency for the secvtx algorithm with systematic uncertainties. The top two plots 
show the tag efficiency for tagging b-jets as a function of the jet Et (left) and \rj\ (right), and the 
bottom two plots show the tag rate for light jets (mistags), also as a function of jet Et (left) and 
|?7| (right). The fits used as a parameterization in our analysis are also shown. 

We begin with the Kleiss-Stirling matrix element Jfj], which includes both qq — > ti and 
gg — > ti production processes, as well as all spin correlations. The integral formula in Eq. [2] 
requires a 24-dimensional integration (eight four-vectors are needed to describe the reac- 
tion, but energy-momentum conservation together with the negligible masses of the initial 
partons and the final state leptons allow for a trivial phase space dimensionality reduction 
to 24). This is computationally difficult to evaluate, so we make the following simplifying 
assumptions: the lepton direction and momentum are perfectly measured; the directions of 
the partons coincide with the measured jet directions; the light quark masses are zero, the 
b quark from the hadronic top quark is on mass shell, and the b quark from the leptonic 
top quark has zero mass. The last assumption results in a simplification of the kinematic 
equation on the leptonic side, from an 8th-order to a 4th-order polynomial. We introduce 
a prior for the transverse momentum of the ti system into the overall event probability 
formula, but we do not consider the transverse motion of individual initial partons. This 
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allows us to eliminate two more integration variables. The ti transverse momentum prior, 
as constructed from HERWIG MC samples, is nearly independent of the top quark mass for 
masses between 120 and 220 GeV/c 2 . The transverse momentum of the initial partons is 
also neglected for the determination of Z\ and Z2 from the event kinematics. 

We choose the set of seven remaining variables of integration to be the squared masses M 2 
and Myy on both the leptonic decay and hadronic decay side of the ti system, (3 = \og(p q /pq), 
where p q and p q are the magnitudes of the momenta of the two products from the hadronic 
W boson decay, and the two-dimensional transverse momentum vector p T of the ti system. 
Note that the top quark and W boson pole masses, m t and mw, are not the same variables 
as M t and My/. The latter variables refer to the top quark and W boson masses in a given 
event, and we integrate over them in our likelihood calculation. 

The expected distributions of M 2 and are defined almost exclusively by the top 
quark and W boson propagator terms in the matrix element. Nominally, these are relativis- 
ts Breit-Wigners peaked at the top quark and W boson masses; however, the kinematic 
assumptions to reduce the number of integration dimensions described above cause the M t 2 
and M^r distributions to be altered from their Breit-Wigner form. To account for this, we 
replace the Breit-Wigner propagators in the matrix element with propagators that reflect 
the assumptions; we call these adjusted propagators "effective propagators." 

A. Effective Propagators 

The effective propagators are built by calculating invariant masses of "effective partons." 
These objects are constructed in such a manner that their four-momenta can be reproduced 
exactly using only the integration variables and the variables measured in the detector by 
solving kinematic equations consistent with our assumptions. In each MC event, we find 
the assignment of the four tree-level partons to the four highest pr jets reconstructed in 
the detector that minimizes the combined distance in the rj-cp space between partons and 
jets. Then we construct effective partons by building four-vectors that have the energies 
of the tree-level partons, the directions of the matched calorimeter jets, and the masses 
used in the kinematic equation solvers (i.e., zero for light quarks and the leptonic side b, 
and 4.95 GeV/c 2 for the hadronic side b). We associate effective values of top quark and 
W boson mass, as well as j3 and priti), by building these quantities out of the effective 
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partons. The effective partons are also used in the construction of our calorimeter transfer 
functions, for the sake of consistency. 

The construction of the effective propagator on the leptonic side uses values of the lepton 
momentum smeared according to the Gaussian resolution functions given in Section [III 
°(Pt)/pt = 0.1% -p T /(GeV/c) for muons and a(E T )/E T = 13.5%/ \/ E T /GeV © 2% for 
electrons. 

The choice of the neutrino p z is ambiguous; for each event we solve for the p z that 
minimizes the deviation of the effective leptonic W boson and top quark masses, M^eg and 
M tj eS, from the tree-level W boson mass, Mw, ge n, and top quark mass, M ttgcn . The deviation 
is quantified by a \ 2 defined as 

2 _ (M f 2 eff - M 2 gcn ) 2 (M^ cff -M^ gen ) 2 

where 

a t = T t m t , a w = T w m w , (4) 

m t and rriw are pole masses for the top quark and the W boson, and T t and Tw are their 
decay widths. When this p z search is performed, the transverse momentum of the leptonic- 
side top quark is set to the difference between the tree-level MC value of the tt transverse 
momentum and the effective transverse momentum of the hadronic-side top quark. 

In our calculations we assume that there is no correlation between the effective prop- 
agators on the hadronic and leptonic sides of the event. In reality, the invariant masses 
on the leptonic side are affected by the hadronic side uncertainties due to the definition of 
the leptonic top quark momentum used in the effective propagator construction. However, 
the uncertainty due to the transverse momentum transfer from the hadronic to the leptonic 
side is not large in comparison with the uncertainty associated with the unknown transverse 
momentum of the ti system itself. 

The widening of the effective invariant masses in comparison with the original Breit- 
Wigner distributions depends on the event kinematics. For example, finite angular resolution 
of the detector results in the greatest widening of the effective W boson mass distribution 
on the hadronic side when the opening angle between the two jets originating from the W 
boson decay is close to tt/2. Due to the large dimensionality of the phase space and limited 
CPU resources available, it is not feasible to model the shapes of the effective invariant 
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mass distributions for each kinematic configuration encountered in the calculation of event 
probabilities. Instead, we characterize these shapes using a low-dimensional quantity with 
high predictive power: the covariance matrix for effective W boson and top quark masses. 

On the hadronic side of the event we calculate the appropriate Jacobian and propagate 
the uncertainties from jet masses and angles to M 2 and using the standard, first-order 
multivariate error propagation formulae. We assume that the uncertainties of jet masses 
and angles are not correlated. The angular resolutions calculated with herwig and used 
to build the hadronic-side propagators are shown in Fig. [5j where the angular resolution is 
defined as the width of the At] or A0 distribution in a given parton pt bin. The resolution 
on the jet mass squared used for covariance matrix estimation is assumed to be constant: 
cr(m 2 ) = 242 GeV 2 /c 4 for 6-jets and a(m 2 ) = 202 GeV 2 /c 4 for light jets. (These values are 
also calculated with herwig.) The uncertainties on the magnitudes of the parton momenta 
are not used to build this covariance matrix — these uncertainties are taken into account 
by the transfer functions. 
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FIG. 5: Angular resolution in rj (left) and eft (right) as a function of parton pp. The dashed line 
indicates the resolution for light quarks, and the solid line for b quarks. 

Three independent quantities can be extracted from the covariance matrix constructed 
in this manner: the standard deviations for Mf and and the correlation coefficient. We 
assume that the hadronic side effective propagator depends only on these three quantities. 
However, we do not make any functional assumption about the propagator shape. Instead, 
we use MC events to build a non-parametric estimate of the propagator density. We split the 
3-D space of the two standard deviations and the correlation coefficient into cells that con- 
tain approximately equal numbers of events. In each cell, we use a kernel density estimation 
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technique [25j to construct the effective propagator. The propagator is initially evaluated on 
a sufficiently dense rectangular grid, and fast linear interpolation is used to find its values 
during subsequent calculations. When the event is reconstructed during the likelihood inte- 
gration, the covariance matrix is calculated for the given kinematics, and the corresponding 
effective propagator density is looked up in a table of pre-computed propagators. 

Due to the presence of unobserved neutrino, the kinematic configuration on the leptonic 
side of the event is significantly less constrained than that on the hadronic side, and provides 
less information about the top quark mass and the jet energy scale. Because of this, we 
employ a simplified model for the leptonic side effective propagators. These propagators 
are averaged over various kinematic configurations, and they depend only on the assumed 
top quark pole mass, not on the kinematics of a particular event. (The hadronic side 
propagators depend on the top quark pole mass implicitly, via the Jacobians used in the 
error propagation.) Examples of the hadronic and leptonic side effective propagators are 
shown in Fig. El 
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FIG. 6: Contour plots of the propagators in Myy and AMj. Left: Breit-Wigner propagator, for 
comparison. Center: Sample effective propagator on the hadronic side of the event. Right: Sample 
effective propagator on the leptonic side of the event. 



B. Transfer Functions 

The transfer functions relate the parton transverse momentum, p?, to the measured jet 
momentum. They are probability distributions of p/E, the ratio of the magnitude of the 
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jet momentum p to the parent effective parton energy E, parametrized as a function of the 
transverse momentum of the parent effective parton (described in the previous section). 
We construct our transfer functions using tt — > lepton + jets MC events in a wide range 
of top quark masses, requiring the same selection cuts as described earlier. In this sample, 
the parton is matched to the simulated jets, p/E distributions are created in bins of the 
parent parton p?, and then these distributions are fit with a p^-parametrized function. 



The function is constructed using Johnson curves [26], which allow us to fit a variety of 
non-Gaussian shapes. These curves are parametrized by quantities calculated from the 
transfer function distributions themselves: mean (/i), standard deviation (a), skewness 
(s), and kurtosis (k), which, in turn, smoothly depend on the parton px- We extrapolate 
the fitted transfer functions for momenta that are below the cutoff value imposed in the 
sample. This extrapolation ensures that the transfer functions are correctly normalized, 
as it accounts for jets that do not appear in our sample due to selection cuts. Separate 
transfer functions are created for four different i] regions of the detector, as well as for b 
quarks and light quarks. Figure [7] shows transfer function examples for light and b quarks 
for several different parton p? values and rj bins. 



C. Normalization and acceptance 

The normalization factor N(m t ) in Eq. [2] is obtained by integrating the Kleiss-Stirling 
matrix element 6J together with the PDF's and the flux factor over the phase space formed 
by the two initial and the six final state particles. The resulting cross section as compared 
to the tt cross section in HERWIG is shown in Fig. [HJ We do not expect perfect agreement 
due to the absence of radiation in our code. 

The resulting normalization is then corrected by a small additional factor to account 
for the difference between the effective propagators and Breit-Wigners. The normalization 
correction is calculated as follows. We split the matrix element into two parts: |M(x)| 2 = 
Ftree(^) \M(x)*\ 2 , where x completely specifies the phase space point, r tree (af) is the product 
of the four Breit-Wigners, and |M(x)*| 2 is the rest of the matrix element. Using this notation, 
the tree-level normalization is J F(x)r tree (x) \M (x)*\ 2 dx, where F(x) is the remaining term 
in the cross section (flux factor and structure functions). The correct normalization with 
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FIG. 7: Sample fitted transfer functions for light and b quarks in r/ bins. Transfer functions are 
shown for light quarks with parton Pj- = 40 and 70 GeV/c (top left and right, respectively), and 
for b quarks with parton Pt = 40 and 70 GeV/c (bottom left and right, respectively). 

the effective propagators is instead 



/ 



F(£ eff )r eff (£ eff )|M(£efr)*| dx eS , (5) 

where x e g specifies the kinematic configuration of the effective partons. This quantity can 
be rewritten as 
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r F(x cS )r cS (x cS )\M(x cS y\ 2 dx cS 2 

J F(x)T tme (x)\M(xy\ 2 dx nx)T^{x)\M{x) \ dx. (6) 

From this point forward we proceed as if there is a one-to-one correspondence between 
points in x and x e g [33(. By construction, the propagators play the role of densities in their 
corresponding spaces. Therefore, T e g(x^)dx e fi = T tree (x)dx, and we need only to calculate 



This quantity is the average value of 



F(x eS )\M(x efi )*\ 2 
F(x)\M(x)*\ 2 ' 1 j 

calculated over the tree-level MC events, times the cross section. The resulting correction 
factor is plotted in Fig. M as a function of the top quark mass. 




FIG. 8: Left: Cross section obtained from our normalization calculation as a function of mt 
compared with the cross section used in herwig. Right: Normalization correction factor due to 
effective propagators as a function of m t . 



The acceptance A(m t , JES) is obtained from ti MC events in which parton angles are 
randomized to simulate the small angular resolution uncertainty of the detector, and parton 
momenta are smeared according to our transfer functions to mimic the jet momenta that 
would have been measured in the detector. The kinematic distributions for the smeared 
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events are similar to those of fully simulated events. We do this for all the values of the 
top quark mass and JES over which the likelihood function is defined, and then calculate 
the acceptance at each m t and JES value to be the fraction of these MC events that pass 
our selection cuts. The advantage of this approach as opposed to using fully simulated 
MC events is that the jet-parton association is exact, and events with incorrect jet-parton 
association can be excluded from the efficiency calculations. Our probability model describes 
tree-level signal events with the correct set of jets; therefore, we do not use fully simulated 
events, which include effects not accounted for in our model, such as gluon radiation. The 
transfer functions are normalized with respect to all jet momenta, not just those which 
pass the cuts. By building an acceptance function from events smeared according to our 
transfer functions, we directly normalize our likelihood. Furthermore, we can generate our 
acceptance from a much larger sample of events because we avoid the computing intensive 
steps of event simulation and reconstruction, while reducing statistical fluctuations in the 
resulting curve. Figure [9] shows the 2-D acceptance as a function of m t and JES. 




FIG. 9: Acceptance used in the integration as a function of mt and JES. 
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V. BACKGROUND DISCRIMINATION AND FINAL LIKELIHOOD 



Our integration method calculates the likelihood for an event assuming the ti hypothesis, 
hence we use a neural network approach to discriminate against background events. The 
neural network uses ten inputs: seven variables describing the kinematics of the event (the 
Pt for the four leading jets, the lepton Et, Et, and Ht, the scalar sum of these quantities), 
and three variables describing the topology of the event. The three topological variables 
are aplanarity, Dr, and i^TZ- The aplanarity is defined as 3/2Qi, where Q\ is the smallest 
eigenvalue of the normalized momentum tensor = YliPaPb/ \Pi\ 2 > where the indices 
a and b run over the three axes x, y, and z, the index i runs over the four jets and charged 
lepton, and Pi is the three-momentum of a given particle. The variable Dr = Ai?™ m • 
min(pz )/Pti where Ai?™ in is the smallest AR between any pair of jets, mm{pi'^) is the 
smaller of the two p z values for the two jets in that pair, and p l T is the transverse momentum 
of the charged lepton. H^z is a ratio of scalar sums of transverse and longitudinal momenta; 
the numerator contains all jets except the leading jet, and the denominator sums all jets, the 
charged lepton, and the neutrino. The smaller \p"\ solution given by the kinematic equation 
for the leptonic W boson decay (assuming M w = 80.4 GeV/c 2 ) is taken. The ten variables 
are summarized in Table [TTJ To construct the neural network, we use the jetnet neural 
network package, version 3.5 27]. 

The neural network is trained to separate ti events with a mass of 170 GeV/c 2 from 
W + bb background; we then cross-check the neural network with other signal masses and 
background types to make sure that the output shape is not dependent on the signal mass 
present. Figure [TU] shows the neural network output, q, for a variety of different samples. We 
compute the background fraction for each observed event as /b g (<?) = B(q) /[B(q) + S(q)], 
where the background distribution B(q), obtained by adding each type of background with 
its own weight, and signal distribution S(q) are each normalized to their overall expected 
fractions. 

Our total likelihood from all events will naturally contain likelihoods from signal events 
and background events. However, only the signal events will contain meaningful informa- 
tion about m t . Thus, we want to remove the contribution due to background events from 
the total likelihood to recover the likelihood from signal events. (Note that there is not a 
separate matrix element for background processes — the likelihood for all events, signal and 
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TABLE II: Variables included in the neural network discriminant: the first seven are kinematic 
variables, the last three are topological variables. 



Variable 



Definition 



P T 

E l T 

Et 

Aplanarity = 3/2 ■ Qi 



D R = ARf™ 



min(pi M) )/PT 



Pt of each of the 4 leading jets 

Charged lepton Et (electron) or px (muon) 

The missing Et 

Scalar sum of jets and lepton transverse momenta and Et 
Q±: smallest eigenvalue of the momentum tensor 
Ai?™ m is the smallest AR between any pair of jets 



Htz = Ylt=2 \Pr\/(Yli=i \Pz\ + \Pz\ + \Pz\) Ratio of scalar sums of transverse to longitudinal momenta 



o 

30.12 



S 0.1 
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FIG. 10: The distributions of our neural network discriminant variable for signal and background 
MC events. The peaks on the right are from tt signal events at three top masses. The peaks on 
the left are from various types of backgrounds. 

background, is calculated under the assumption that the event is a tt signal event.) Con- 
sequently, we compute from Monte Carlo simulation the average likelihood for background 
events and subtract out the expected contribution due to background events from the total 
likelihood: 
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logL mod (m t , JES) = 22 ^°S L (yi\ m t, JES)] - n hg log L hg (m t , JES), (9) 

86 events 

where L mod is the modified total likelihood for a given set of events, L($|m t , JES) is the 
likelihood for an individual event, n^ g the expected number of background events, and 
Lbg{tTLt, JES) is the average likelihood for a background event as computed in Monte Carlo 
simulation. This calculation is performed separately for 1-tag and >l-tag events, as the 
background fractions and Lb g (m 4 , JES) are different for the two subsamples. 

We can rewrite Eq. [9] in terms of the individual per-event background fraction to obtain 
our final modified likelihood L mod : 

logL mod (m t , JES) = ^ [\ogL(yl\m t , JES) — \ogL hg (m t , JES)], (10) 

iGcvcnts 

where ,/bg (<??.) is the background fraction given the discriminant variable g« for a given event. 
Equations [9] and [TO are equivalent if the number of background events in the data is equal 
to the expected background contribution. However, the advantage of using Eq. [TO is that if 
there are more or fewer background-like events in our data than expected, the average value 
of /bg(<2i) will be correspondingly higher or lower, thus compensating for the difference. 

There is another class of events not well-modeled by our signal likelihood integration or 
handled by the background subtraction above, which we call "bad signal" events. These are 
tt signal events in which the four observed jets and/or lepton are not directly produced from 
the tt decay. These events exist due to a variety of causes (extra jets from gluon radiation, 
tt events where both W bosons decay leptonically or hadronically, W — > tv decay, etc.) and 
comprise roughly 35% of our total signal. For a signal mass of 172 GeV/c 2 , 36.2% of the 
single-tag and 30.9% of the > 1-tag events fall into the "bad signal" category. 

We observe that the peaks of the likelihood curves for these "bad signal" events tend to be 
generally lower than the peaks for well-behaved tt events. Figure [TT] shows the distribution 
of the peak value of the likelihood curves for "good signal" , "bad signal" , and background 
events. We adopt a cut on the peak value of the likelihood of 6, which retains only the bins 
dominated by "good signal" . Table II III shows the efficiency of this cut for "good signal" 
events, "bad signal" events, and background events for m t = 172 GeV/c 2 . With this cut we 
remove ~22% of the "bad signal" events and ~29% of the background events while retaining 
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FIG. 11: Distributions of the peak value of the log-likelihood for MC events, divided into "good 
signal", "bad signal", and background events. Left: 1-tag events, right: multiple-tag events. The 
vertical lines indicate where the likelihood cut is applied. 

~95% of "good signal" events. While this cut reduces the size of our sample, the overall 
resolution is significantly improved due to the improved validity of our assumptions about 
the sample. 

TABLE III: Efficiency of the likelihood cut at a value of 6 for mt = 172 GeV/c 2 . The uncer- 
tainties shown are the binomial uncertainty on the cut, summed appropriately across the different 
background types for the background. 

Type of event 1-tag > 1-tag 

Good signal 93.6% ± 0.4% 96.9% ± 0.4% 

Bad signal 76.8% ± 0.9% 77.5% ± 1.5% 

Background 70.4% ± 0.5% 68.9% ± 1.3% 



Our calculation gives us a 2-dimensional joint likelihood as a function of m t and JES. We 
treat the JES as a nuisance parameter and eliminate it using the profile likelihood, i.e., we 
take the maximum value of the likelihood along the JES axis for each m t value. That is: 



L pTO {(m t ) = max L(m t ,j) (11) 
jeJES 
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This gives us a 1-D likelihood curve in m t only. We then follow the normal procedure of 
taking the position of the maximum likelihood as our reconstructed mass and descending 
1/2 unit of log-likelihood from the peak to determine the estimated uncertainty. Because of 
imperfections in our model, these quantities need to be calibrated in order to obtain a final 
measured mass and uncertainty. 

VI. TEST OF THE METHOD AND CALIBRATION 

We test our method using MC samples of fully simulated and reconstructed tt events and 
the background samples described in Section IIHI We construct pseudoexperiments (PEs) 
from the MC samples with an average total number of events equal to the number observed 
in the data. As shown in Tabled] we observe 371 events, of which 70.3 ± 16.5 are expected 
to be background. After applying the likelihood cut efficiencies for signal and background 
simulated events, we expect a total of 303 events, which is the number of events that we 
use in our PEs. The number of each type of event (signal and each background type) is 
Poisson-fluctuated about its expected contribution to the total. We perform 2000 PEs for 
each signal top quark mass value and compute the resulting average reconstructed mass, 
bias, expected statistical uncertainty, and pull width. Figure [12] shows the reconstructed 
mass, bias, and pull width versus the input top quark mass, where the bias is defined as the 
difference between the true mass m true and the reconstructed mass m rec and the pull width 
is the width of the distribution of (m rec — m true )/(cr m ) rec in individual PEs, where (a m ) rec 
is the estimated uncertainty. The output mass is a linear function of the input mass with 
a slope very close to 1; the mass bias and the pull width are independent of the input top 
quark mass. The non-zero bias and non-unit pull width are due to the presence of events 
not modeled in our effective propagator model ( "bad signal" events and background) in our 
analysis; if we run PEs on "good signal" events only, we obtain a bias and average pull width 
consistent with and 1, respectively. 

We use the observed slope, bias, and pull width to calibrate our method. Specifically, 
the reconstructed mass is calibrated by correcting for the measured bias and slope, and the 
estimated uncertainty is calibrated by correcting for the measured pull width and slope. 
We perform our calibrations in terms of Am t = m t — 172 GeV/c 2 ; our overall calibration 
formulae to obtain a calibrated mass Am ca i and mass uncertainty (a m ) ca \ in terms of the 
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FIG. 12: Pseudoexperiment results using fully simulated signal and background events after ap- 
plying a likelihood cut, with a mean of 303 events for each PE. For these samples, JES is fixed at 
its nominal value of 1. Top left: reconstructed vs. input top quark mass; top right: bias vs. input 
top quark mass; bottom: pull width vs. input top quark mass. 

observed mass Am„b s and uncertainty (tr m ) bs are: 



where ci, c 2 , and c 3 are the slope in the upper-left plot, the constant in the upper-right 
plot, and the constant in the bottom plot of Fig. [121 respectively. Using the fits shown in 
these plots, we obtain Cl = 0.995 ± 0.006, c 2 = -1.09 ± 0.06 GeV/c 2 , and c 3 = 1.20 ± 0.01. 

The 2-D likelihood method measures JES in situ in the ti sample. To ensure that this 
method correctly handles events where the JES is not necessarily equal to its nominal value, 
we also check simulated samples where the JES has been shifted from its nominal value 
of unity. Specifically, we use four different JES shifts: JES = 0.95, 0.97, 1.03, and 1.05, 




Am, 



cal 



(Am obs - c 2 )/c!, 



(12) 
(13) 
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to obtain a calibration for our JES measurement in the same way that we calibrate our 
m t measurement above. Figure [13] shows the reconstructed JES, JES bias, and JES pull 
width versus the input JES for m t = 170 GeV/c 2 . The non- unity pull width for JES is 
due to the same origin as the non-unity pull width for the top mass. We use these results 
to obtain our calibration for the reconstructed JES. We perform our calibrations in terms 
of A(JES) = JES — 1, yielding the final formulae for our calibrated A(JES), A(JES) ca i, 
and JES uncertainty (ajEs)cai in terms of the observed value, A(JES) b s , and uncertainty 

( cr JEs)obs : 

A(JES) cal = (A(JES) obs - c 5 )/c 4 (14) 

(ojEs)cal = (VjEs)obs X C 6 /c 4 , (15) 

where c 4 , c 5 , and c 6 are the slope in the upper-left plot, the constant in the upper-right 
plot, and the constant in the bottom plot of Fig. [131 respectively. Using these results, we 
obtain c 4 = 1.03 ± 0.04, c 5 = 0.0003 ± 0.0013, and c 6 = 1.17 ±0.01. 

Since the m t and JES calibration parameters are derived for JES fixed at 1 and mt fixed 
at 170 GeV/c 2 , respectively, we also need to ensure that they do not vary for different values 
of m t and JES. Figure [T41 shows the results of these studies. We note that the JES and m t 
slope and bias do not noticeably change for different m t and JES inputs. The plot on the 
bottom also shows that the reconstructed top quark mass is very stable with respect to the 
input JES, showing that our procedure of independent calibration of the two variables is 
valid. 

VII. SYSTEMATIC UNCERTAINTIES 

We discuss the systematic uncertainties on our measurement in this section. Most sys- 
tematics are evaluated using a general procedure where we shift a given quantity by ±1<t 
in the signal MC sample and/or in the background samples, perform PEs to measure a top 
quark mass on the shifted sample, and use the resulting shift in the measured top quark mass 
as our systematic uncertainty. In this paper, we have rounded all of the final systematic 
uncertainties to a precision of 0.1 GeV/c 2 ; although some systematics are known to a higher 
precision than this, some are not. 
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FIG. 13: JES pseudoexperiment results using fully simulated signal and background events after 
applying a likelihood cut, with a mean of 303 events for each PE. The top quark mass here is fixed 
at mt = 170 GeV/c 2 . Top left: reconstructed vs. input JES; top right: JES bias vs. input JES; 
bottom: JES pull width vs. input JES. 



• Calibration: In calibrating our final result, we use the bias and slope constants ci, 
c 2 , and c 3 as described in Section [VTl The uncertainty on these constants is a source of 
systematic uncertainty in our measurement. The uncertainty in c\ is the major source 
of our final quoted uncertainty of 0.1 GeV/c 2 . 

• Residual JES: Although the 2-D measurement is designed to capture any changes in 
the JES, we assume a constant factor for the jet energy scale, whereas the jet energy 
systematic uncertainties depend on the jet px and rj. Furthermore, the JES uncertain- 
ties are composed of the sum of several different potential sources, each of which may 
vary differently. To evaluate potential systematic uncertainties due to this assump- 
tion, we shift the jet energies by one standard deviation for each source of systematic 
uncertainty (corrections for relative response of different sections of the calorimeter, 
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FIG. 14: Results of studying samples with shifted JES values. Top left: Mass bias vs. input top 
quark mass at the five different JES points; top right: JES bias vs. input JES at the three different 
mass points; bottom: reconstructed top quark mass vs. input JES for the three different top quark 
masses. 

the absolute corrections in cone with AR = 0.4, out of cone correction details, and 



other minor sources as described in Ref. 



28(|) . We perform these tests shifting only 



the signal, and also shifting the signal together with the W + bb background, and take 
the greater of the resulting differences. The resulting shifts are added in quadrature 
to obtain our residual JES systematic uncertainty of 0.5 GeV/c 2 . 

6-JES: We have assumed that the JES is the same for all jets. However, there is 
an additional uncertainty arising from relative differences between b- and light-quark 
jets. (Note that the jet systematic uncertainties are predominantly determined using 
light jets.) We identify three sources of uncertainty: one due to the uncertainty 
in the semileptonic decay ratio, which we estimate by varying this ratio by ±lcr; 
one due to the uncertainty in the 6-fragmentation modeling, which we evaluate by 
varying the parameters used in the Bowler fragmentation model in the PYTHIA 
Monte Carlo generator, using two different sets of parameters derived from SLD and 
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LEP results [30]; and one due to uncertainty in the calorimeter response for fe-jets 
compared to light-quark jets. Because the calorimeter response is determined in light- 
jet samples, the different charged particle fraction and momentum spectrum in 6-jets 
could result in a different response. We evaluate this uncertainty by checking the effect 
of the calorimeter corrections in simulated light jets and 6-jets separately. Then we 
propagate this difference in the response by shifting the Et of jets identified as 6-jets 
in the ti Monte Carlo sample. The three corrections to the 6-jet energy scale yield 
uncertainties of 0.1 GeV/c 2 , 0.3 GeV/c 2 , and 0.1 GeV/c 2 , respectively, for an overall 
uncertainty of 0.3 GeV/c 2 . 

Generator: We evaluate a systematic due to the MC generator used by comparing the 
results from HERWIG and pythia ti samples. We take the resulting difference of 0.6 
GeV/c 2 as our systematic uncertainty. There is also a potential systematic uncertainty 



for color reconnection effects not included here |3l|]; current studies suggest that these 



may not significantly increase our total systematics. 

ISR and FSR: Systematic errors due to initial-state radiation (ISR) and final-state 
radiation (FSR), where additional gluons are radiated, are evaluated using MC sam- 
ples. A control sample of Drell-Yan events allows us to compare the dilepton invari- 
ant mass spectrum in data and MC and estimate the uncertainty on the parameters 
controlling the radiation [281 ] . Those parameters have been varied by the estimated 
uncertainty to study the effect on the top mass measurement. In this case the un- 
certainty on the measured top mass shift is larger than the shift itself, so we use the 
uncertainty of 0.3 GeV/c 2 as our quoted uncertainty. 

PDFs: We evaluate the systematic uncertainty due to the parton distribution func- 
tions (PDFs) used in the matrix element integration by comparing different PDF sets 



(CTEQ5L 2^1 an d MRST72 32] ), varying a s , and varying the eigenvectors of the 
CTEQ6M PDFs. The final uncertainty is defined by the sum of the eigenvector uncer- 
tainty in quadrature and the a s difference. The CTEQ-MRST difference in our case 
is negligible. The result is an overall uncertainty of 0.4 GeV/c 2 . 

Background: There are several uncertainties associated with our background sub- 
traction. First is the uncertainty due to uncertainty on the overall background frac- 
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tion. The second source is the uncertainty in the background composition, which we 
assess by setting the background to, in turn, 100% W + bb, W + cc/c, W + light, 
or QCD background and taking the largest resulting shift as our uncertainty. Third, 
the uncertainty associated with our average background likelihood Lb g (m 4 ,JES) as 
described in Section |Vj to evaluate this uncertainty, we divide the sample into two 
disjoint subsamples (one with only electrons, and one with only muons), build the 
average background likelihood curve from one subsample, and measure the top quark 
mass using the other subsample. Finally, we account for uncertainties due to the Q 2 
scale used by the background MC generator. The resulting systematic uncertainties 
are 0.3 GeV/c 2 , 0.4 GeV/c 2 , 0.3 GeV/c 2 , and 0.2 GeV/c 2 , respectively. 

• Lepton px' To account for the 1% uncertainty on the measured lepton p?, we apply 
our method to samples where the lepton pt has been shifted by this amount, resulting 
in an uncertainty on the top quark mass of 0.1 GeV/c 2 . 

• Permutation weighting: We account for a potential systematic for the tagging 
probabilities used to weight our permutations (the Wi factors in Eq. [2]), since these are 
derived from fits to the tagging probabilities measured in data. We estimate that the 
predominant source of uncertainty in this estimate is the ratio of charm tags to 6-tags, 
which is nominally 22%. We vary this by its relative uncertainty of 15% and measure 
the resulting difference, which is negligible. 

• Pileup: We consider two sources of uncertainty due to multiple pp interactions. First, 
we consider the fact that the number of interactions in our Monte Carlo samples is 
not equal to the number observed in the data. To estimate this effect, we divide 
our Monte Carlo samples into subsamples with different number of interactions in 
the event, examine the slope of the resulting measured top quark mass as a function 
of the number of interactions, and multiply this by the difference in the number of 
interactions between Monte Carlo events and data events. Second, we consider the 
modeling of the additional interactions in an event. Our current model is derived from 
minimum bias events, so we consider the possibility that it does not correctly model 
tt events. For this purpose, we compare the observed jet response as a function of 
the number of vertices in tt Monte Carlo events and minimum bias data and use the 
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resulting difference to obtain a systematic uncertainty. We take the larger of these 
two sources, 0.2 GeV/c 2 , as our systematic uncertainty. 

• Gluon fraction: herwig and pythia are both leading-order MC generators, so it 
events in these samples are approximately 95% produced from qq annihilation and 
5% produced from gg fusion. However, NLO expectations are closer to (15 ± 5)% gg 
production. To check for a potential systematic due to this effect, we run PEs where 
qq and gg events are reweighted so that the qq weights sum to 0.80 and the gg events 
to 0.20 (using the maximal gg percentage to be conservative) and use the resulting 
shift of 0.3 GeV/c 2 as our uncertainty. 

Table IIVI summarizes our final list of systematic uncertainties. 

TABLE IV: Total list of systematic uncertainties. 



Systematic source 


Systematic uncertainty (GeV/c 2 ) 


Calibration 


0.1 


Residual JES 


0.5 


6-JES 


0.3 


MC generator 


0.6 


ISR and FSR 


0.3 


PDFs 


0.4 


Background: fraction 


0.3 


Background: composition 


0.4 


Background: average shape 


0.3 


Background: Q 2 


0.2 


Lepton pt 


0.1 


Pileup 


0.2 


Gluon fraction 


0.3 


Total 


1.2 
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VIII. RESULTS 



In the data we find a total of 318 events which pass all of our selection requirements 
(including the likelihood peak value cut), of which 237 have exactly 1 tag and 81 have more 
than 1 tag. We combine the likelihoods for the 1-tag and >l-tag subsamples using the 
formula described in Eq. [10] and then combine the two subsamples. After obtaining this 
total likelihood, we use the profile likelihood method introduced in Section [V] to extract a 
top quark mass value. That mass value is then corrected using the calibration procedure to 
obtain a mass value of 172.7 ± 1.8 GeV/c 2 . The left plot in Fig. [[5] shows the resulting 2-D 
likelihood contours after calibration for l-a, 2-a, and 3-a uncertainties, assuming that they 
have Gaussian distributions. 

To validate the likelihood cut used in our procedure, we compare distributions of the event 
peak likelihoods in data and MC events. The K-S confidence level for these two distributions 
is 93.8%, indicating a very good agreement between the data and MC simulations. 
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FIG. 15: Left: Measured 2-D likelihood on the data events. The plot shows the contours corre- 
sponding to a 1-cr, 2-a, and 3-a uncertainty (assuming Gaussian behavior) in our measurement. 
The calibration derived from MC events has been applied to both axes. The marker shows the 
point of maximum likelihood. Right: Likelihood peak position of the individual likelihood curves 
for data and MC events. The dashed line indicates the likelihood cut of 6 employed. 

This result combines the statistical uncertainty and the uncertainty due to JES. To 
separate these two contributions, we fix the JES value to be 1.0 and evaluate the uncertainty 
in the resulting 1-D likelihood as a function of m t . This yields an uncertainty of 1.2 GeV/c 2 . 
We conclude that the remaining uncertainty of 1.3 GeV/c 2 is due to the JES. The expected 
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statistical + JES uncertainty from MC events at a top quark mass of 172 GeV/c 2 peaks at 
1.8 GeV/c 2 , in good agreement with the measured uncertainty in the observed event sample 
of 1.8 GeV/c 2 ; 50% of pseudoexperiments show a smaller uncertainty than that measured in 
the data. The distribution of the expected uncertainties is shown in Fig. [TBI The additional 
systematic uncertainty on the measured top mass discussed in Section [VIII is 1.2 GeV/c 2 , 
yielding a final result of: 

m t = 172.7 ± 1.2 (stat.) ± 1.3 (JES) ± 1.2 (syst.) GeV/c 2 
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FIG. 16: Expected statistical uncertainty (including uncertainty due to JES) on the top mass from 
the 2-D profile likelihood method, derived from MC events with nit = 172 GeV/c 2 . The black 
arrow indicates the uncertainty in the data measurement. All uncertainties have been scaled by 
the average pull width of 1.198. They are also corrected by 1/0.995 to account for a measured 
response slope slightly different from 1. 

From the 2-D likelihood, we can also obtain a JES measurement by using the profile like- 
lihood technique to eliminate the mt axis and applying the calibration described previously. 
This yields a measurement of JES = 1.008 ± 0.013, indicating that the JES is well within 
its expected value. Note that the profile likelihood measurement technique does not impose 
any prior expectations on our JES value. 
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IX. CONCLUSION 



We have used 1.9 fb _1 of data at the Tevatron to measure the mass of the top quark 
using the lepton + jets topology. Our analysis uses a modified matrix element integration 
technique. To date, analyses using this technique have made simplifying kinematic assump- 
tions for the purposes of computational tractability. The method described in this paper 
includes the first attempt to compensate directly for these assumptions. Our measured top 
quark mass with 318 events passing all our selection criteria is: 

m t = 172.7 ± 1.8 (stat. + JES) ± 1.2 (syst.) GeV/c 2 
or combining statistical and systematic uncertainties (assuming Gaussian behavior) 

m t = 172.7 ±2.1 (total) GeV/c 2 . 

Our model at the moment does not take into account events where a jet from top quark 
decay is missing or is replaced by a jet from the parton shower. Proper treatment of these 
"bad signal" events should help improve the measurement. 
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